Introduction
Hyperinflammatory response induced by the SARS-CoV19 (CV) coronavirus is the main cause of morbidity and mortality. Numerous studies have pointed-out the main role of monocyte activation. In addition neutrophils alterations appear to differ pathophysiologically from the changes that occur in Influenza Virus (IV) infection. Due to the overlap of symptoms between these two entities, the search of analytical markers that help in early diagnostic orientation is considered of crucial importance. Changes in cell function, phenotype, and morphology in circulating leukocytes can be translated into numerical data obtained from an automated analyzer. The objective of our study is to generate an Artificial Intelligence Model from conventional hematological blood count parameters which be able to discriminate between CV and IV infection, in a fast and efficient maner.
Methods
This is a retrospective single-center study, performed between January-April 2020. The patients (n = 816) were divided into two groups: Patients who came for suspected COVID and had a positive RT-PCR (n = 408) and patients with a diagnosis of influenza confirmed by RT-PCR (n = 408). The database was divided into two random subgroups (n = 654) to train the model and another (n = 162) to validate it.
The first hemogram on admission to the Emergency Department of these patients was performed on a Beckman-Coulter® DXH-900 equipment. Total white blood cells, absolute neutrophils, absolute lymphocytes, absolute monocytes, monocyte distribution wide (MDW) and Cell Morphological Data (CMDs) based on the impedance, conductivity and light scattering of these leukocyte subpopulations have been used to construct the model.
Five algorithms have been evaluated using the R studio Software and the Caret (Classification and Regression Training) package: Linear Discriminant Analysis (LDA), K-Nearest Neighbor (kNN), Neural Networks (NN), Support Vector Machines (SVM) and Recursive partitioning (Rpart).
Results
The evaluation of the different models was based on the comparison of the efficacy obtained through a cross validation (10x). It was decided to choose the SVM model by presenting a median of the area under the ROC curve of 0,841. No data preprocessing was performed, and the parameters chosen for the model were: sigma = 0,014, C = 1 and Number of Support Vectors = 458.
Parameters with greater importance (>80%) in the model, were CMDs based on Neutrophil Light Scattering (SDLNE, SDLAN, SDMNE and MNLNE).
The analysis of results was performed using a confusion matrix, where the model predicts the diagnosis of each patient in the validation subgroup (Table 2). A ROC curve with an area of 0,892 was obtained, with a sensitivity and specificity of 80% and 85%, respectively (Fig 1).
Conclusions
The creation of prediction algorithms from hemogram parameters allow to discriminate between COVID 19 infection and influenza A and B with a high specificity and sensitivity in a fast way. This could be a great advance for the early diagnostic orientation and guide clinical decisions as soon as possible with the consequent clinical benefit.
No relevant conflicts of interest to declare.
Author notes
Asterisk with author names denotes non-ASH members.
This feature is available to Subscribers Only
Sign In or Create an Account Close Modal